Picture for Yuxin Zhang

Yuxin Zhang

Tony

GEN3D: Generating Domain-Free 3D Scenes from a Single Image

Add code
Nov 18, 2025
Viaarxiv icon

Test-Time Iterative Error Correction for Efficient Diffusion Models

Add code
Nov 09, 2025
Viaarxiv icon

Step-Audio-EditX Technical Report

Add code
Nov 05, 2025
Figure 1 for Step-Audio-EditX Technical Report
Figure 2 for Step-Audio-EditX Technical Report
Figure 3 for Step-Audio-EditX Technical Report
Figure 4 for Step-Audio-EditX Technical Report
Viaarxiv icon

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

Add code
Aug 27, 2025
Figure 1 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 2 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 3 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 4 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Viaarxiv icon

Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning

Add code
Aug 25, 2025
Viaarxiv icon

Levarging Learning Bias for Noisy Anomaly Detection

Add code
Aug 10, 2025
Viaarxiv icon

DS$^2$Net: Detail-Semantic Deep Supervision Network for Medical Image Segmentation

Add code
Aug 06, 2025
Viaarxiv icon

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon

GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models

Add code
Jul 16, 2025
Viaarxiv icon